Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Chem Inf Model ; 55(5): 972-82, 2015 May 26.
Artigo em Inglês | MEDLINE | ID: mdl-25871613

RESUMO

Molecule and atom fingerprints, similar to path-based Daylight fingerprints, can substantially improve the accuracy of P450 site-of-metabolism prediction models. Only two chemical fingerprints have been used in metabolism prediction, so little is known about the importance of fingerprint parameters on site of metabolism predictions. It is possible that different fingerprints might yield more accurate models. Here, we study if tuning fingerprints to specific site of metabolism data sets can lead to improved models. We measure the impact of 484 specific chemical fingerprints on the accuracy of P450 site-of-metabolism prediction models on nine P450 isoform site of metabolism data sets. Using a range of search depths, we study path, circular, and subgraph fingerprints. Two different labelings, also, are considered, both standard SMILES labels and also a labeling that marks ring bonds differently than nonring bonds, enabling ortho, para, and meta positioning of substituents to be more clearly encoded. Optimal fingerprint models chosen by cross-validation performance on the full training data are, on average, 3.8% (Top-2; percent of molecules with a site of metabolism in the top two predictions) and 1.4% (AUC; area under the ROC curve) more accurate than base fingerprint models. These gains represent, respectively, a 25.6% and 16.7% reduction in error. A more rigorous assessment selects fingerprints within each cross-validation fold, sometimes selecting different fingerprints for different folds, but yielding a more reliable estimate of generalization error. In this assessment, averaging the scores from the top few fingerprints yields performances improvements of, on average, 3.0% (Top-2) and 0.7% (AUC). These gains are statistically significant and represent, respectively, a 20.1% and 8.8% reduction in error. Between different isoforms, not many consistencies were observed among the top performing fingerprints, with different fingerprints working best for different isoforms. These results suggest that there are important gains achievable in site of metabolism modeling by including and optimizing atom and molecule fingerprints. The optimal site of metabolism models determined by this approach are available for use at http://swami.wustl.edu/.


Assuntos
Biologia Computacional/métodos , Sistema Enzimático do Citocromo P-450/metabolismo , Descoberta de Drogas , Sítios de Ligação , Sistema Enzimático do Citocromo P-450/química , Internet , Isoenzimas/química , Isoenzimas/metabolismo , Reprodutibilidade dos Testes
2.
Bioinformatics ; 31(12): 1966-73, 2015 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-25697821

RESUMO

MOTIVATION: Cytochrome P450s are a family of enzymes responsible for the metabolism of approximately 90% of FDA-approved drugs. Medicinal chemists often want to know which atoms of a molecule-its metabolized sites-are oxidized by Cytochrome P450s in order to modify their metabolism. Consequently, there are several methods that use literature-derived, atom-resolution data to train models that can predict a molecule's sites of metabolism. There is, however, much more data available at a lower resolution, where the exact site of metabolism is not known, but the region of the molecule that is oxidized is known. Until now, no site-of-metabolism models made use of region-resolution data. RESULTS: Here, we describe XenoSite-Region, the first reported method for training site-of-metabolism models with region-resolution data. Our approach uses the Expectation Maximization algorithm to train a site-of-metabolism model. Region-resolution metabolism data was simulated from a large site-of-metabolism dataset, containing 2000 molecules with 3400 metabolized and 30 000 un-metabolized sites and covering nine Cytochrome P450 isozymes. When training on the same molecules (but with only region-level information), we find that this approach yields models almost as accurate as models trained with atom-resolution data. Moreover, we find that atom-resolution trained models are more accurate when also trained with region-resolution data from additional molecules. Our approach, therefore, opens up a way to extend the applicable domain of site-of-metabolism models into larger regions of chemical space. This meets a critical need in drug development by tapping into underutilized data commonly available in most large drug companies. AVAILABILITY AND IMPLEMENTATION: The algorithm, data and a web server are available at http://swami.wustl.edu/xregion.


Assuntos
Algoritmos , Biologia Computacional/métodos , Sistema Enzimático do Citocromo P-450/metabolismo , Modelos Moleculares , Bibliotecas de Moléculas Pequenas/metabolismo , Xenobióticos/metabolismo , Sistema Enzimático do Citocromo P-450/química , Humanos , Simulação de Acoplamento Molecular , Relação Estrutura-Atividade
3.
J Chem Inf Model ; 53(12): 3373-83, 2013 Dec 23.
Artigo em Inglês | MEDLINE | ID: mdl-24224933

RESUMO

Understanding how xenobiotic molecules are metabolized is important because it influences the safety, efficacy, and dose of medicines and how they can be modified to improve these properties. The cytochrome P450s (CYPs) are proteins responsible for metabolizing 90% of drugs on the market, and many computational methods can predict which atomic sites of a molecule--sites of metabolism (SOMs)--are modified during CYP-mediated metabolism. This study improves on prior methods of predicting CYP-mediated SOMs by using new descriptors and machine learning based on neural networks. The new method, XenoSite, is faster to train and more accurate by as much as 4% or 5% for some isozymes. Furthermore, some "incorrect" predictions made by XenoSite were subsequently validated as correct predictions by revaluation of the source literature. Moreover, XenoSite output is interpretable as a probability, which reflects both the confidence of the model that a particular atom is metabolized and the statistical likelihood that its prediction for that atom is correct.


Assuntos
Sistema Enzimático do Citocromo P-450/química , Simulação de Acoplamento Molecular , Redes Neurais de Computação , Bibliotecas de Moléculas Pequenas/química , Biotransformação , Domínio Catalítico , Sistema Enzimático do Citocromo P-450/metabolismo , Humanos , Isoenzimas/química , Isoenzimas/metabolismo , Ligantes , Probabilidade , Ligação Proteica , Bibliotecas de Moléculas Pequenas/metabolismo , Relação Estrutura-Atividade , Especificidade por Substrato , Termodinâmica
4.
J Chem Inf Model ; 53(12): 3352-66, 2013 Dec 23.
Artigo em Inglês | MEDLINE | ID: mdl-24261543

RESUMO

Computational methods that can identify CYP-mediated sites of metabolism (SOMs) of drug-like compounds have become required tools for early stage lead optimization. In recent years, methods that combine CYP binding site features with CYP/ligand binding information have been sought in order to increase the prediction accuracy of such hybrid models over those that use only one representation. Two challenges that any hybrid ligand/structure-based method must overcome are (1) identification of the best binding pose for a specific ligand with a given CYP and (2) appropriately incorporating the results of docking with ligand reactivity. To address these challenges we have created Docking-Regioselectivity-Predictor (DR-Predictor)--a method that incorporates flexible docking-derived information with specialized electronic reactivity and multiple-instance-learning methods to predict CYP-mediated SOMs. In this study, the hybrid ligand-structure-based DR-Predictor method was tested on substrate sets for CYP 1A2 and CYP 2A6. For these data, the DR-Predictor model was found to identify the experimentally observed SOM within the top two predicted rank-positions for 86% of the 261 1A2 substrates and 83% of the 100 2A6 substrates. Given the accuracy and extendibility of the DR-Predictor method, we anticipate that it will further facilitate the prediction of CYP metabolism liabilities and aid in in-silico ADMET assessment of novel structures.


Assuntos
Inteligência Artificial , Hidrocarboneto de Aril Hidroxilases/química , Citocromo P-450 CYP1A2/química , Simulação de Acoplamento Molecular , Bibliotecas de Moléculas Pequenas/química , Hidrocarboneto de Aril Hidroxilases/metabolismo , Biotransformação , Domínio Catalítico , Citocromo P-450 CYP1A2/metabolismo , Citocromo P-450 CYP2A6 , Humanos , Ligação de Hidrogênio , Interações Hidrofóbicas e Hidrofílicas , Ligantes , Ligação Proteica , Bibliotecas de Moléculas Pequenas/metabolismo , Relação Estrutura-Atividade , Especificidade por Substrato , Termodinâmica
5.
Bioinformatics ; 29(20): 2655-6, 2013 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-23918250

RESUMO

SUMMARY: Scaffold network generator (SNG) is an open-source command-line utility that computes the hierarchical network of scaffolds that define a large set of input molecules. Scaffold networks are useful for visualizing, analysing and understanding the chemical data that is increasingly available through large public repositories like PubChem. For example, some groups have used scaffold networks to identify missed-actives in high-throughput screens of small molecules with bioassays. Substantially improving on existing software, SNG is robust enough to work on millions of molecules at a time with a simple command-line interface. AVAILABILITY AND IMPLEMENTATION: SNG is accessible at http://swami.wustl.edu/sng


Assuntos
Ensaios de Triagem em Larga Escala , Bibliotecas de Moléculas Pequenas/análise , Mineração de Dados , Descoberta de Drogas , Estrutura Molecular , Bibliotecas de Moléculas Pequenas/química , Software
6.
Bioinformatics ; 29(4): 497-8, 2013 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-23242264

RESUMO

SUMMARY: Regioselectivity-WebPredictor (RS-WebPredictor) is a server that predicts isozyme-specific cytochrome P450 (CYP)-mediated sites of metabolism (SOMs) on drug-like molecules. Predictions may be made for the promiscuous 2C9, 2D6 and 3A4 CYP isozymes, as well as CYPs 1A2, 2A6, 2B6, 2C8, 2C19 and 2E1. RS-WebPredictor is the first freely accessible server that predicts the regioselectivity of the last six isozymes. Server execution time is fast, taking on average 2s to encode a submitted molecule and 1s to apply a given model, allowing for high-throughput use in lead optimization projects. AVAILABILITY: RS-WebPredictor is accessible for free use at http://reccr.chem.rpi.edu/Software/RS-WebPredictor/


Assuntos
Sistema Enzimático do Citocromo P-450/metabolismo , Software , Algoritmos , Cinarizina/química , Cinarizina/metabolismo , Isoenzimas/metabolismo
7.
J Chem Inf Model ; 52(6): 1637-59, 2012 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-22524152

RESUMO

RS-Predictor is a tool for creating pathway-independent, isozyme-specific, site of metabolism (SOM) prediction models using any set of known cytochrome P450 (CYP) substrates and metabolites. Until now, the RS-Predictor method was only trained and validated on CYP 3A4 data, but in the present study, we report on the versatility the RS-Predictor modeling paradigm by creating and testing regioselectivity models for substrates of the nine most important CYP isozymes. Through curation of source literature, we have assembled 680 substrates distributed among CYPs 1A2, 2A6, 2B6, 2C19, 2C8, 2C9, 2D6, 2E1, and 3A4, the largest publicly accessible collection of P450 ligands and metabolites released to date. A comprehensive investigation into the importance of different descriptor classes for identifying the regioselectivity mediated by each isozyme is made through the generation of multiple independent RS-Predictor models for each set of isozyme substrates. Two of these models include a density functional theory (DFT) reactivity descriptor derived from SMARTCyp. Optimal combinations of RS-Predictor and SMARTCyp are shown to have stronger performance than either method alone, while also exceeding the accuracy of the commercial regioselectivity prediction methods distributed by Optibrium and Schrödinger, correctly identifying a large proportion of the metabolites in each substrate set within the top two rank-positions: 1A2 (83.0%), 2A6 (85.7%), 2B6 (82.1%), 2C19 (86.2%), 2C8 (83.8%), 2C9 (84.5%), 2D6 (85.9%), 2E1 (82.8%), 3A4 (82.3%), and merged (86.0%). Comprehensive datamining of each substrate set and careful statistical analyses of the predictions made by the different models revealed new insights into molecular features that control metabolic regioselectivity and enable accurate prospective prediction of likely SOMs.


Assuntos
Sistema Enzimático do Citocromo P-450/metabolismo , Isoenzimas/metabolismo , Especificidade por Substrato
8.
IEEE Trans Pattern Anal Mach Intell ; 34(6): 1068-79, 2012 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-21987558

RESUMO

We present a bundle algorithm for multiple-instance classification and ranking. These frameworks yield improved models on many problems possessing special structure. Multiple-instance loss functions are typically nonsmooth and nonconvex, and current algorithms convert these to smooth nonconvex optimization problems that are solved iteratively. Inspired by the latest linear-time subgradient-based methods for support vector machines, we optimize the objective directly using a nonconvex bundle method. Computational results show this method is linearly scalable, while not sacrificing generalization accuracy, permitting modeling on new and larger data sets in computational chemistry and other applications. This new implementation facilitates modeling with kernels.


Assuntos
Algoritmos , Reconhecimento Automatizado de Padrão/métodos , Inteligência Artificial , Humanos , Redes Neurais de Computação , Máquina de Vetores de Suporte
9.
J Chem Inf Model ; 51(7): 1667-89, 2011 Jul 25.
Artigo em Inglês | MEDLINE | ID: mdl-21528931

RESUMO

This article describes RegioSelectivity-Predictor (RS-Predictor), a new in silico method for generating predictive models of P450-mediated metabolism for drug-like compounds. Within this method, potential sites of metabolism (SOMs) are represented as "metabolophores": A concept that describes the hierarchical combination of topological and quantum chemical descriptors needed to represent the reactivity of potential metabolic reaction sites. RS-Predictor modeling involves the use of metabolophore descriptors together with multiple-instance ranking (MIRank) to generate an optimized descriptor weight vector that encodes regioselectivity trends across all cases in a training set. The resulting pathway-independent (O-dealkylation vs N-oxidation vs Csp(3) hydroxylation, etc.), isozyme-specific regioselectivity model may be used to predict potential metabolic liabilities. In the present work, cross-validated RS-Predictor models were generated for a set of 394 substrates of CYP 3A4 as a proof-of-principle for the method. Rank aggregation was then employed to merge independently generated predictions for each substrate into a single consensus prediction. The resulting consensus RS-Predictor models were shown to reliably identify at least one observed site of metabolism in the top two rank-positions on 78% of the substrates. Comparisons between RS-Predictor and previously described regioselectivity prediction methods reveal new insights into how in silico metabolite prediction methods should be compared.


Assuntos
Citocromo P-450 CYP3A , Modelos Moleculares , Acetaminofen/química , Acetaminofen/metabolismo , Sítios de Ligação , Citocromo P-450 CYP3A/química , Citocromo P-450 CYP3A/metabolismo , Estrutura Molecular , Estereoisomerismo , Varfarina/química , Varfarina/metabolismo
10.
ACS Med Chem Lett ; 1(3): 96-100, 2010 Jun 10.
Artigo em Inglês | MEDLINE | ID: mdl-24936230

RESUMO

SMARTCyp is an in silico method that predicts the sites of cytochrome P450-mediated metabolism of druglike molecules. The method is foremost a reactivity model, and as such, it shows a preference for predicting sites that are metabolized by the cytochrome P450 3A4 isoform. SMARTCyp predicts the site of metabolism directly from the 2D structure of a molecule, without requiring calculation of electronic properties or generation of 3D structures. This is a major advantage, because it makes SMARTCyp very fast. Other advantages are that experimental data are not a prerequisite to create the model, and it can easily be integrated with other methods to create models for other cytochrome P450 isoforms. Benchmarking tests on a database of 394 3A4 substrates show that SMARTCyp successfully identifies at least one metabolic site in the top two ranked positions 76% of the time. SMARTCyp is available for download at http://www.farma.ku.dk/p450.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...